Information content and word frequency in natural language: Word length matters
نویسندگان
چکیده
منابع مشابه
Information content and word frequency in natural language: word length matters.
For centuries, scientists have attempted to uncover commonalities that underlie the structure of human languages (1). In a recent issue of PNAS, Piantadosi et al. (2) reported an exciting finding with respect to one unique type of language universal. The authors empirically demonstrated that word length strongly correlated with information content across 11 distinct natural languages. This find...
متن کاملWord Length Andword Frequency
Since the appearance of Zipf’s works, (esp. Zipf 1932, 1935), his hypothesis “that the magnitude of words tends, on the whole, to stand in an inverse (not necessarily proportionate) relationship to the number of occurrences” (1935: 25) has been generally accepted. Zipf illustrated the relation between word length and frequency of word occurrence using German data, namely the frequency dictionar...
متن کاملWord-length entropies and correlations of natural language written texts
We study the frequency distributions and correlations of the word lengths of ten European languages. Our findings indicate that a) the word-length distribution of short words quantified by the mean value and the entropy distinguishes the Uralic (Finnish) corpus from the others, b) the tails at long words, manifested in the high-order moments of the distributions, differentiate the Germanic lang...
متن کاملInformation content versus word length in natural language: A reply to Ferrer-i-Cancho and Moscoso del Prado
Recently, Ferrer i Cancho and Moscoso del Prado Mart́ın (2011) argued that an observed linear relationship between word length and average surprisal (Piantadosi, Tily, & Gibson, 2011) is not evidence for communicative efficiency in human language. We argue that their study of a random typing model is largely irrelevant to human language: their model critically rests on incorrect assumptions abou...
متن کاملWord-Forming Process in Azeri Turkish Language
The subject intended to study the general methods of natural word-forming in Azeri Turkish language. This study aimed to reach this purpose by analyzing the construction of compound Azeri Turkish words. Same’ei (2016) did a comprehensive study on word-forming process in Farsi, which was the inspiration source of this study for Azeri Turkish language word-forming. Numerous scholars had done vari...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the National Academy of Sciences
سال: 2011
ISSN: 0027-8424,1091-6490
DOI: 10.1073/pnas.1103035108